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1 AGCTTTATAA CCATGTGATC CCATCTTATG GTTTCAATCC ATGCACAGGA 

51 GGAAAATTGT GGGCACGAAG TTTCCAAAGG GAAAATTTAT AGATTGGTAG 

101 TTAATGAAAT ACAGTTTTCC TCCTTGGCAA ATTTAATTTA CTAGCTTCAC 

151 TGTATAGGAA AAAGCAGGAA AAAAATTAAA ACCAACTCAC CTCCAAACCT 

201 GTTTTGAGCT TTTACTTGTC TGCCCAATTG ATAGTTTCTA CTCTCTGCTT 

251 TTGATGAAAA TATTTTTTAT TATTTTAATG TAACTTCTGA AAACTAAATT 

301 ATCTAGAAGC AAATAAAAAG ATATTGCTTT TATAGTTCCC AGAAGGAAAA 

351 AACAAACACT AGGAAAGTTC TATCTATCAG ATGGGGGAGA TGTGATGGAG 

4 01 GCAGTGATAT TTGAGCTGAG CCTTGAACAA TGAACAGGAG TCTACCAAGC 

4 51 GAGAGGCTAG CGGGTGGCCC TCAAGATAAA ACAACAGCAT GTACAAAGGC 

501 ATGGAGACAT ACACATCTTG ACTCTTCCAG GAATGGTGGG AACGCTGGTG 

551 GAGCTAGAAT GTAGGTACAT AGCATAAAGT GGCAGACGGG AAGCCTTTGG 

601 AAATCTTATT ACATAGGACC CTGGATGCCA TTCCAATGAC TTTGAATTTT 

651 CTGTAGGCTG CCAGCGAAAT TTCCAAGCGT GATAGAGTCA TGTCTATCTA 

701 TGCACTTCAG AAAGACAACC TCAGGGTTAA TGAAGAAAAT GCATTGGAAT 

751 ATAAGAAACT GGTGACCAGA GTGATCAATT GCATGACTGT TGTGAAAGTC 

8 01 CAGGTGAGGG GAGCTGTGGG CAAGGTCAGA GTTGAGAGGC ATTTCAGAGA 

851 TAAAATGACA GTAACTAAGT AGATGTCAGG CTGAGAAGAA AGGGCTGTAC 

901 CAGATATATG GTGCTATCAT TAAGTGAGCT CAACATTGCA GAAAAGGGGT 

951 AGGTTTGGTG GGAGTTGCTC ACAAAACATG TTTAGTCTAA GCAAAACCAT 

1001 TGCCATGGGC TCAGATAAAA GTTAAGAAGT GGAAACCATT CCTACATTCC 

1051 TATAGGAGCT GCTATCTGGA AGGCCTAGTA TACACGTGGC TTTTCAGCTG 

1101 TGATTTTGTT TGATTTTAGG GATTATTCTT TTTCTGAATC TGAGCAATGT 
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1151 TAGCGTGTAA AATACTCACA CCCACAGCTT TGACTGGGTG AGAAGTTATC 

1201 ATAAATCATA TTGAGTTTGT TGTGATACCT TCAGCTTCAA CAAGTGATGA 

12 51 GTCAGGTCAA CTCCATGTGA AAGTTCCTTG CTAAGCATGC AGATATTCTG 

1301 AAAGGTTTCC TGGTACACTG GCTCATGGCA CAGATAGGAG AAATTGAGGA 

1351 AGGTAAGTCT TTGACCCCAC CTGATAACAC CTAGTTTGAG TCAACCTGGT 

14 01 TAAGTACAAA TATGAGAAGG CTTCTCATTC AGGTCCATGC TTGCCTACTC 

14 51 CTCTGTCCAC TGCTTTCGTG AAGACAAGAT GAAGTTCACA GTGAGTAGAT 

1501 TTTTCCTTTT GAATTTACCA CCAAATGATT GGAGACTGTC AATATTCTGA 

1551 GATTTAGGAG GTTTGCTTCT TATGGCCCCA TCATGGAAAG TTTGTTTTAA 

1601 AAAAATTCTC TCTTCAAACA CATGGACACA GAGAGGGGAA CAACACACAC 

1651 CAGGTCCTGT TGGGGGGTGG AGAGTGAGGG GAGGGAACTT AGAGGACAGG 

17 01 TCAATAGGGG CAGCAAACCA CCATGGCACA CATATACCTA TGTAACAAAC 
1751 CTGCACGTTC TGCACATGTA TCCCTTTTTT TTAGAAGAAG AAATAATGAA 
1801 AAAAAACCTT TTTTCTATTT ATATAATCAT GGCATTTATA AGCATCTCTA 

18 51 TAGAGAAGGA TAATTGTGCT GAGATTAGAC AGCTGTCTGA GCACCTCACA 
1901 CTGACCTATT TTTAACAAAA TGACTTTCCA CATCACCTGA TTTCGGCTCC 
1951 ATGCRGGGTA AGCAGTTCCT AAGCCCTAGA AAGTGCCGAT CATCCCTCAT 
2001 TCTTGAATTC CTCCTTTTAT TTACCAAAAT TCCTGAGCAT GTTCAGGAAA 
2 051 GATGAAAAGC TTATTATCAA AATAAGTGGC TGAGATAGAC TTCTTGTCAC 
2101 ATTTGTTACA GTAAAATGGG TCTCCAAGAA AGAAAGATTT GCCTTGGGCT 
2151 CTAGCATGGC CATTTATTTA AGAAAGCATC TGAAACATGA AGCTACCACA 
2201 GCATCTCTCC TGTGGTTCCA GACGGAAGCC TGAGAGTCTA GGAGGAGGTG 
2251 GACCGAGAAA CCCTGCCAAA GTAACTAGTA GTGCCGGGTT TCTCACAACA 
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2301 CGATGCAAAG GGGCTAGAAT CAGATGACTA TTTTCATGTT TCAACATACT 

2 351 ACACACTGGA AAACGTTACG GCAGACTCTA CTTTATAATG GGGCTGCAAA 

24 01 TGTAAAATGA CTACTAGAAC TAGGTCCTCT TAATAGCAGC AAAGTTTAAA 

24 51 AGGGTCAGAG GGAGCTCCAG ACACAGGTTA GATTTGATTT CTCTCCTAGT 

2501 TCTGCTGTGA ACAAGAGGTA TAAGTTTGGC CAACTCACTT AACCCCTGAA 

2551 GCTCAGTTAC CTTATCTGTA AAATGATTGC ATTGTACTAG GTGTTCTCTA 

2601 AAATTTCTTC TACCTCTGAC TTTTTAGGAG ACTAATTTTT AACTCCTTTT 

2 651 TAAGCTATTG GGAGAAAAAT TTAATTTTTT TTCAAAAGTT ACCTTGAATC 

2701 TCTAGAGCAG TTCTCAAAAC TATTTTGTCC CAGGCAAAGG AAATGAGACT 

2751 AGGTACCCAG AATGAGGCAC CCTGCATAAA GCTCTGTGCT CTGAAAACCA 

2 801 ATGTCAGGGA CCCTGTGATA AAT AAT T AAA CCAAGTATCC TGGGACACTG 

2851 CTAGTGACAT CGCCTCTGCT GATCACTCTT GCCAGCGAGA CACTCTATAC 

2 901 TTGCTTTCTC ATCATTGGCA TCCAAACTGC CTACTAATCC ATTGCTTTGG 

2 951 AAAGTTTTTT TTAATAAAAA GATTATTTCT AT TAG GAG G A AAACATCCCA 

3001 TGTTAAATAG GAAAATTAAC TGAAATCATT TTCAGATGTG ATTTTTAGCA 

3051 CTTATAGCCA TTTCAAACCA TGGTATTCAT TTATACTATG CTATTTATTG 

3101 TAAAACTTCT TTTTTTTTCC AAGGAAAATA AGATAGTTTG CTTTATTTTA 

3151 AAACAGTAAC TTTCTTATAT TGGGGCACTG ACCAAAATTC AATACTGGTA 

3201 CAAATATGTT ACCTAGGGGG TCAAAATATG TGCCAGGTGA ATTTTCTGAA 

3251 TTTCTCTAAA GAGAGAATTT TAAACCTTAT AAAAC AAT T A GAAACAAGTG 

3301 AGTGAGAGGT GAGCATCAAC AACCTGTGTA ACATAAGCCA CAGTACAAAT 

3351 TTAAGCTGAA TAACCAAGCC ATGTCAGTTA TCCCAAATCA TTTTTGTTAA 

34 01 TATTTAGGAG GATACACATA TTTTCAATAA CTTAAAAGTG AATCTTTACT 

3451 CCTATCTCTT AATACTCGAA GAAGTATAAC TTTCTTCTTT TACTAGATTT 
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CAAGGTAGGA 
AATCATGAAG 
CTTGAAAGCT 
ATGGTTTATA 
ATAATTTATA 
ATATTTTGAC 
AAGTATTATC 
AGAAACAAGA 
ATAATTTGAT 
ATGGCAATGA 
TCAATTTTTC 
ATATCACATT 
GTTAAAACCT 
ATTGAACTAG 
GCTTGGATAG 
CAATGTTATT 
TCTAGCTCCT 
TACCAAAATG 
TTATAAACTT 
GTGGGCAGCA 
GACAATAACA 
TGTAGGTAGT 
CTTTTTAACA 
TCTAATTACA 
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4 7 01 AAAGGTGACC ATTTTGCTCA TCCGATTTTG TTCCCTAGAG ATAACTACTG 

4 7 51 TTAATAAGTG TTGCATGATC AGTTAAAATT CAAACCAACA AACACTATGT 

4 8 01 TCAAGGGATT GTGGGTATAT ACAACAAATA TGAACATCCT TTTGCCTTGC 

4 8 51 CTGCAGATAC CCTCAATAAT GCTGAAAGAC TTATACAACA TTACTGCTTC 

4 901 CAAAGCTTAG ACTATCTCAC TTTGTTTTCA AAGGAGGTTT TACGACCTTC 

4 951 TAAAGAGATT GAAATTGACA TTTCACCTAA AACTCGGGAA ATGTAAATGA 
5001 CAATATTAAT TGGTAAGAGA GGAAAGAAGA AAGAAAGAAG GAAGGAAAGA 

5 051 AAGAAAGAAG GAAGGAAGGA AAGAAAGAAA GAAAGAAAGA AAGAGAGAGA 
5101 AAGAAAGAAA AAGAAAAAAG AGAGAAAGAG AGAAGGAAAG AAAGAGAGAA 
5151 GGAAAGGAAA AGAGAAGCAA AGAAAGAGAG GAGCAAAGAA AGGAACACTT 
52 01 AGCACTAGTT GGGAGACCCA ACTCTGGAAT TATCAGCTAT ATATTTAACA 
5251 AACGTTATAC TTTTAAATAG CAAACTCTTT ATTGTTTCAA TTTTATCTGG 
5301 TCAATTGGAA AAATAATTTT TGTCTTATCT GTCTCCTTGA AATGTGAGGA 
5351 TCAAAGGAGA CTAAAACATG ATAGCTTTTA AAGTCTATTT CAGTAAAACA 
54 01 GACTTATATA GAGGGGTTTT TATCATGCTG GAACCTGGAA ATAAAGCAAA 
54 51 CCAGTTAGAT GCTCAGTCTC TGCCCTCACA GAATTGCAGT CTGTCCCCAC 
5501 AAATGTCAGC AAT AG AT AT G ATTGCCAAGC AGTGCCCCAT CCAGTGCTCT 
5551 TATCCCAGCT CAT C AC GAT C TTGGAGTTCC CATTTCTCTC TGCAGGTGGA 
5 601 ACTGACCTCT GATAAGAAAA GCTCCTCGGA GAACACATGC CTCACTATTT 
5 651 GCCATCTACT TTAACAGGGC TTTGCTGCAA CCAGACTCTT TCAAAAGAAG 
57 01 ACATGCATTG TGCACAAAAT GAACAAGGAA GTCATGCCCT CCATTCAATC 
57 51 CCTTGATGCA CTGGTCAAGG AAAAGAAGGT AAAAATAAAA GGCTTTTTAT 
5801 TTTTGGTGAG GGGAGAGGTT TTACATCCTT CAGTAAATAA CGAGAAGATC 
5851 ACAGTCATTC CCTCTTGACT ACAGTATGTT GTAGTGTGCA GCACAAAGGG 
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5 901 GGAAGTTATT GGTGATTGCC 

5 951 GCTGTGGCTC ACACCTACCT 

60 01 TTGACTGTTA GAGGAGCAAA 

6051 CTTTGCCAAG GACATTTTTC 

6101 GCAAATGAAA ATAGAAGATA 

6151 CAGGTCTTTT TAGTATAATA 

62 01 AAAAG AT C C A TTGTTATCTG 

62 51 TTGTCTGGCC ATACTAATCC 
6301 TTTAAAATAT TTGCCTGAGT 

63 51 TTTCATTGAC AAGGTAGAGA 

64 01 TTATTACATA GTCTAATCCA 

64 51 GTTGCAGAAT AAATTAAGCC 

65 01 TTTCAAACCA GGTAAATCCC 
65 51 GAATAAGCTA GAAATGGCTC 
6601 TGAGCAACAC TCACTGTTGG 
6651 TGGGCTTGGC ATTCATTCTC 
67 01 ACACTCAGCT TCAGGGTAAG 

67 51 ATGTACTCAG TCAACCCAAA 

68 01 AAACATTGCA AACATGTGTC 
68 51 TGCAAGGTGA GTAGCATCCC 
6901 ATTGTCAGAC TATCCTCGCG 
6951 GATGGTCATC AAGGCCAACA 
7001 TGTTCTATGT GCTTTAAGTG 
7051 AGGTTCTATT TTAATCCTAC 
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TGAGGGAAGG CAACTTCTGC CACATCAAAT 
CTACAACCGC TGAGCAAAGC ACTTGAAACC 
GCTCTGGTCA CACCAATAGG AGCCTCAGTA 
TGCAAGAGTT AGTTAGGGTT ATTAGATTTA 
TCCAGTTAGG TTTGAATTTT AGGTAAGCAG 
TATCCTATGC AATATTTGGG ATATACTAAA 
AAATTCAAAT GTAACTGGGT ATTGTATATT 
AGGTGAGTGG AAAGAAGAGA TCCATAATGT 
TCATATTCCT ATAACTGATA AATGAGTACC 
AAATAAATAA ACTGCATTCT CAGAAGATGA 
AGGAATCTAT GATGACCAAA TGAGGTCCAA 
TCAGACTTCT GTGTTTATGA GAAGCTGAGG 
TTAGGACACT TAGAAATGCT AAGATATACA 
TTCTTCATCT TGATTATGGA AAAATTTAGC 
CCTCGTATAC CCCTCAAGTC AACAAACCAC 
TCCCATTCTT CCTTTCTACC TCTCTTTTCC 
GGACCAGGAG GACCACCTCC CAAGGGCCTG 
CAAAGTCGAT GACCTGAGCA AGTTCGGAAA 
GTGGGATTCC AAC AT AC AT G GCTGAGGAGA 
TACTGTGCAC CCCAAGTTAG TGCTGGTGGG 
CGTGTCCATA GTGGGCACCA GTGATGCAGG 
TTTGTGCAGT GCTTGCTCTG TGCCAGGTAC 
TGTTAACTCG GTTCTTCACA GCAATCTTAT 
TTTATGGATG AGGAAACTGA GGTACAGAGA 

FIG. 1F 
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-5!$/ 7101 GGTCACAAAA TCCTTGCCTG GGTCAATTCC AAGCATTTTG GCTGTGGATT 

7151 CTGTGCTCTT AAATATTATG GAACACTGCC TTTTAAGTGT GAATCAAGAG 

72 01 TAGACTCAAG TCATATTCAA AAGAATGCAT GAATGGCTAA ATGAAAGAAG 

7251 AATGCTAATA GAATCTATTA ACTTTCTATA GCTCAGACAA TCACTTAATT 

7 301 TCTGGACATT CAAAGAACAG CTGCACACAA ACAAAGTGTC TACCTAGGGA 

7351 CCTAACTTAA TGGCAATTTT CCAGATCTCT GAATTGATTG ATTTCATCAC 

7 4 01 AACAAGTAGA TAAACCTTGA CATTAGCACA TAGCTAGTTT GGAAACCCCT 

7451 ACTCCCCCAA TCCCCTCCAA GAAAAGAGTC CTTAAATAGA CATTAATATA 

7501 GGCTTCTTCT TTTCTCTTTA TTAGAGGCAA GCCTGTTTTT TTACTCAGGA 

7551 ACGTGCTACA CGACCAGTGT ACTATGGATT GTGGACATTT CCTTCTGTGG 

7 601 AGACACGGTG G AG AAC T AAA CAATTTTTTA AAGCCACTAT GGATTTAGTC 

7 651 ATCTGAATAT GCTGTGCAGA AAAAATATGG GCTCCAGTGG TTTTTACCAT 

7701 GTCATTCTGA AATTTTTCTC TACTAGTTAT GTTTGATTTC TTTAAGTTTC 

77 51 AATAAAATCA TTTAGCATTG AATTCAGTGT ATACTCACAT TTCTTACAAT 

7801 TTCTTATGAC TTGGAATGCA CAGGATCAAA AATGCAATGT GGTGGTGGCA 

7 851 AGTTGTTGAA GTGCATTAGA CTCAACTGCT AGCCTATATT CAAGACCTGT 

7 901 CTCCTGTAAA GAACCCCTTC AGGTGCTTCA GACACCACTA ACCACAACCC 



7 951 TGGGAATGGT TCCAATACTC TCCTACTCCT CTGTCCACTG CTTAA 
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CATGCTTGCC TACTCCTCTG TCCACTGCTT TCGTGAAGAC AAGATGAAGT 

1 TCACAATTGT CTTTGCTGGA CTTCTTGGAG TCTTTCTAGC TCCTGCCCTA 

1 GCTAACTATA ATATCAACGT CAATGATGAC AACAACAATG CTGGAAGTGG 

1 GCAGCAGTCA GTGAGTGTCA ACAATGAACA CAATGTGGCC AATGTTGACA 

1 ATAACAACGG ATGGGACTCC TGGAATTCCA TCTGGGATTA TGGAAATGGC 

1 TTTGCTGCAA CCAGACTCTT TCAAAAGAAG ACATGCATTG TGCACAAAAT 

1 GAACAAGGAA GTCATGCCCT CCATTCAATC CCTTGATGCA CTGGTCAAGG 

1 AAAAGAAGCT TCAGGGTAAG GGACCAGGAG GACCACCTCC CAAGGGCCTG 

1 ATGTACTCAG TCAACCCAAA CAAAGTCGAT GACCTGAGCA AGTTCGGAAA 

1 AAACATTGCA AACATGTGTC GTGGGATTCC AACAT AC AT G GCTGAGGAGA 

1 TGCAAGAGGC AAGCCTGTTT TTTTACTCAG GAACGTGCTA CACGACCAGT 

1 GTACTATGGA TTGTGGACAT TTCCTTCTGT GGAGACACGG TGGAGAACTA 

1 AACAATTTTT TAAAGCCACT ATGGATTTAG TCATCTGAAT ATGCTGTGCA 

1 GAAAAAATAT GGGCTCCAGT GGTTTTTACC ATGTCATTCT GAAATTTTTC 

1 TCTACTAGTT ATGTTTGATT TCTTTAAGTT TCAATAAAAT CATTTAGCAT 

1 TG 
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MKFTI VFAGLLGVFLAPALANYNI DVNDDNNNAGSGQQSVSVNNEHNVAN 50 



51 VDNNNGWDSWNSIWDYGNGFAATRLFQKKTCIVHKMKKEVMPSIQSLDAL 100 

101 VKEKKLQGKGPGGPPPKGLMYSVNPNKVDDLSKFGKNIANMCRGI PTYMA 150 



151 EEMQEASLFFYSGTCYTTSVLWI VDISFCGDTVEN 185 



FIG. 3 
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1 GAATTCAAAC AGCAGGCCAT CTTTCACCAG CACTATCCGA ATCTAGCCAT 

51 ACCAGCATTC TAGAAGAGAT GCAGGCAGTG AGCTAAGCAT CAGACCCCTG 

101 CAGCCCTGTA AGCTCCAGAC CATGGAGAAG AGGAAGGTTG TGGGTTCAAG 

151 GAGCTTTTCA GAGTGGAAAT CTGTGGATCA GTGATTTATA AAACACAGTT 

2 01 TCCCCCTTTA TTAGATTTGA ACCACCAGCT TCAGTTGTAG AAGAGAACAG 

2 51 GTTAAAAAAT AATAAGTGTC AGTCAGTTCT CCTTCAAAAC TATTTTAAAC 

301 GTTTACTTAT TTTGCCAAGT GACAGTCTCT GCTTCCTCTC CTAGGAGAAG 

351 TCTTCCCTTA TTTTAATATA ATATTTGAAA GTTTTCATTA TCTAGAGCAG 

4 01 TGGTTCTCAT CCTGTGGGCC ATGAGCCCTT TGGGGGGGTT GAACGACCCT 

4 51 TTCACAGGGG TCACATATCA GATATCCTGC ATCTTAGCTA TTTACATTAT 
501 GATTCATAAC AGTAGCAAAA TTAGTTAGGA AGTAGGAACA AAATAACGTT 

5 51 ATGGTTGTGG TCACCACTAT GTTAGAGGGT CCGCAGCATT CAGAGGGTTG 
601 AGAACTGTTG TTCTAGAGGC AAATAAGAAG ACAGAGTTCC TTGATAGGGC 
651 CCAGAGGCAG TGAAAGAAGT TTCCACGTAG AAAGTGAAGA AGGTCTGGTG 
701 TCCGAAGCAG TGAGGAACTT AAAAAAAGAA AACCAAAAAC ATTGCCAACT 
751 AACAGTCCAG GAGAAGAGCG GGGCATGAAA GGCTGAGTTC CCATGGGATG 
801 CCTTGAATGG AATCAGAGTG TGGGAAAATT GGTGTGGCTG GAAGGCAGGT 
851 GCCGGGCATC TCAGACGCTG GTAGCTGGGG AAAC AG G AAA CCCCTTTAGG 
901 ATCCCAAGAT GCCATTCCAA TGAGCTTGAG ATTTTTCTCA TGGACTGCCA 
951 GTGAATGTTT CTACGCTCCG GAAATTAATG TTTACTTATT TTCCATATTC 
1001 TAGGGGAGAA CCCTGGGAAA AATGGAGGAC ATTCATTGAA ATATCTGAGT 
1051 CCTGGGATAA GGCAGGCTTG GTCCTACAAC TCTGGTAAAA GTCCATCAGG 
1101 AAGTGCCTTG ACCAAGGCTG GAGTGGAGAG CTGTTGGTGA GATGTAAGGG 

FIG. 4A 
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151 CAAGGTTTAG TTGCTAGATA TGTAGATGGC AAGATGGTGC TGCCAACAGC 

1201 CCCCAGAGCT CTAACCCACT GAGAAACCCA GGAATGAATG ATGGGAGATG 

1251 GCTTTGGTGC CAGCTGCTAG TGACATGGCT GGAAAGCTGC ACTGGCTTCG 

1301 AGGCCAGACA ATTCCTCAAG GAAACATCTG GCCAGGGTGC AAGGGCCAGT 

1351 TTCCTTCCTT GGAGTTCCTT TCACAGCTAA GAACATCATC CCCCAACCAC 

14 01 TGGTTTTGTT AAAAAGTTTT CAGTATGACT TGAGCATGGT CAAGAAGCAT 

14 51 AGAGAGGGGG AAATAAGGGT GGAAGGAGCT GGAGAAAGCT TACAATAGGA 

1501 CTGGGTAAAG GGAAGGAGAA GAAACCATTC CCGCATTCCC ATAGGAGCCA 

1551 GTACCAGGAA GGGCAGGTGT ACACACAGAT CTCATCTAAG GCCATGTTTG 

1601 GTTTAGGGAT TACTCTTCTC CCGAATCTGA GCAGCAGCAA TACGTAAAAT 

1651 ACCCACACCC ATGGCTTCCA TATTCCAGAA CTTATCACAA ACCGTGTAGA 

1701 GTTTACTGAG ATACCTTCGT CAGAGGATGA GTCAGAGGCC TCCTGCCTAA 

1751 GGGCCCTACT GAGCAGGCAG CTAAAGGCTT CCGGGCCTCT GCAGCTCCAC 

1801 AGATACAGGA GAGGGAAGCA GATAAGCCGT GGACTCCACC TGAGCACACC 

1851 TAGCTTGAGC AAAGCTGGTC AGGTACAAAT AGCAGAGGGC TGAATGTCTG 

1901 TGAGCACGCC GCCTGATCCT CTGCTCCACC ACACTCCTGC CGCCATGAAG 

1951 CTCACAGTAA GTCAGATCTT CTTTTCAATG CAGCACCATA CAACATTAAT 

2 001 AGTCAGGGGT GAGGGGGTCT GACTCTTACG GCACTGTTAC CATAGTGGAA 

2 051 ATATTCTCCT TTCTTTTCAT GGAATCATGG TGTTTACAAG CATGTCCATA 

2101 GAGAAGAAGA ATTGCCCCGG AAGAGCCTGT CACAGGCTGA ATACTGTAGA 

2151 ATTGTCTTTC ACACCATCTG TTCCAAGGTT CTACTTAAGA CGAGCAGTCT 

2201 CTGGGCTCCA GAAAGAGTCT TTCTTAGCCT TGATCTCTTT CTTATTTCTG 

2251 ATTTCTCCTT TCTTATCCAT GATTTCCACT TTTACCAGTT CTGGGCATGT 

FIG. 4B 




2301 TCCGGTCAGA CTGGAAGATC 

2351 TGGCTGTTAA CATGAAAACA 

2 4 01 TGGAGAAAGT CTCTGGGGAT 

24 51 TAGGATACAG AGGCTCAAAC 

2501 TATTGGCGCT GCTTTATGAA 

2551 ACTATGACCA AGAAGCGTCC 

2 601 AGCACAGGCC AGAGGGCTGC 

2 651 GTTCTCGACC TTCCTAATGC 

2 7 01 TCGTGACTCC CAGCCATAAA 

2 751 TTTTGCTACC ATTATGAGTT 

2 801 AGATAACCTA AGAAACGGTT 

2 851 GGTGGAGAAC TACTGGTCTA 

2 901 TTACAGGAGA TAACATCATG 

2951 GGGTTGTTTT GTTTTGTTTT 

3001 CTGTCCTGGA ACTCACCTTT 

3051 CCGCCTGCTT CTGCCTCCTG 

3101 GTCTGGCTCA CATCTGGCTT 

3151 GAAAATAAAT ATAGTAGAAA 

3201 ATTCCTTCTG CTAGGAAGAA 

3251 CAGGGCTTTC ATTACACTGT 

3301 AGTTTTCAGT GTCTGAAATT 
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ACTGTTGTCA AAACTAGTCT TCAACACTCT 
ACGGTCCTTG GGCCCTGTGC AAGCATTTCT 
GAAGCTATCT CAGTTTCCCC ACTGAAGTCC 
AGAGTGCACA TATTCAATTT CAGCATACTC 
TCATATGAAT TTATGGAATT GGAAATGTAA 
ACCTCAGAAC AGGTTGGGTG GGGAACTCCA 
GTTTCTCTTC TAGTTCTGTC TAGAGGAGTG 
TGTGACCCTT TAATACAGTT CCTCACGTTG 
ATTACTTTCA TTGCTACTGC ATAACTGTAA 
GTAATGTAAA TATCTGATAT GCAAGATACC 
GTTTGACCTT TAAAGGGGTC ACAACCCACA 
GGGTCCTTTA CAGTCCTTTA GCTGCCTCAT 
CTCAAAAACT CCCTCCACAT TTGGCTTTTT 
TCAAGACAGG GTTTCTCTGT GTAGCCCTGG 
GTAGACCAGG CTGGCCTCGA ACTCAGAAAT 
AGCGCTGGGA TTAAAGGCGT GCGCCACCAT 
TTTAAGAGAC CGATTTTAAC TTCTTGCATT 
TGCTTAACCT ACTAAGACAA TAAAAACAGG 
CACGTTCCAG ACTAAGGAAA AAAACCTTTT 
GCCATGCACT AATTTTATGT TTTCTTCATC 
CAGTGTCAAA ATTCTAAGAC TACATATGAA 
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3351 TATCATTACA GTAACTCAGC AATTCTATGT TACCAGTAAG TTTTTCTGTA 

34 01 GTTTAAAAAA AAGGTGGAAG AAGAAAGCAC AGATAGTTTA GCACATGGGT 

34 51 AAAATCAGTA ACTATTTCTG ATGAGCTTGG TGAAGATGCT GTAAACCATG 

3501 CGACCACCAG TCCTGTTCTC TGTGCTTTCA GATGTTCGTC GTGGGTCTGC 

3551 TTGGCCTCCT TGCAGCTCCT GGTTTTGCTT ACGTAAGTCT CATTTTTCTG 

3 601 AAGTTCATTG TCAAAACTGC ATTTACAGTG AAATGTGATC TTAAGTCACC 

3 651 CTCTGCTTCT TATGAACATT AGACGGTCAA CATCAATGGT AATGATGGCA 

37 01 ATGTAGACGG AAGTGGACAG CATTCGGTGA GCATCAATGG TGTGCACAAC 

37 51 GTGGCCAATA TCGACAACAA TAACGGCTGG GACTCCTGGA ATAGCCTCTG 

38 01 GGACTATGAA AACGTATGTA ATGGACACAC AGGGTAAAGA TATGGTGTAG 
3851 CCACCACCCA TTAAAATTTC TGAGGTGAAT TCTAGCTGTT CATGAACATT 
3901 AAAAGCTACC AGTAAAAGTG CCCATTCCAC TCAAAACAAT TTTACTTTTT 

3 951 TGCATATAAT TATTGCTAAT AAGTATTACA CAATAGGTCG AAATTCAAAG 

4 001 GGATCAATAG TAAGGATAAA AACTATGTAC AAAGACAAAC ACAGCATCCT 
4 051 TTGGTCTTCC CTGCAGAGAG TCTCCATGAT GTTAAAGGTC CAATGTTTTA 
4101 TGGAGGCTGA ATGAAATACG AATGCCTCTG TGATGGAAAA GGCCCAACAT 
4151 CTTATGGAGA ATGAGTGAAG TATGAATGCT ATTAGTTGTA AGAGAAGGCG 
4 201 ATGCAAAGCA ACACTTGGCA CCACCTGCCA ATTACTACTT TCCTATTTAA 
4251 ATGTAGTTTA AAAAGCAAAG CCTGTCTTCC CTGCCTCCTG GAAACACTGC 
4 301 GGATGGAGGT AGACCAAGGT ATGACAGCCT TTAAAAGTTT GTCAGCAAAA 
4 351 CACTCCCCCA TACACACATA CACACACCCT CCTACTACAC TGGAACTGAA 



FIG. 4D 
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44 01 GCAAAGGCAG TGGGTTAGAT ATATCCACCC TCTAAGAGTT TGCAGGTCAT 

4 4 51 CTATATATGA TAGCCAGAGA CACAACTGCA GGACAGCCAG ACTCTGAGCA 

4 501 CTCTCCCCAG CTCCTTGTAG CTCTGTTTCA GTGGTGACTT GTGACAAGAA 

4 551 TCCTGGGGAA CCTGTGCCTC ACTGTTCTCT GTCTTCTTTA ATAGAGTTTC 

4 601 GCTGCCACGA GACTCTTCTC CAAGAAGTCA TGCATTGTGC ACAGAATGAA 

4 651 CAAGGATGCC ATGCCCTCCC TTCAGGACCT CGATACAATG GTCAAGGAAC 

4 7 01 AGAAGGTAAA GTCCTGCCTT CTTCTTTGGA GTGACAGGAA GTCTTACAGT 

4 751 CTCCAGTACA CAGTGAAGTC ACCCCCATTC CCTCTTTGGT GGAGCATGAC 

4 8 01 AGCATGTTTG TCATGATAAA TGCCACAAAC ATGTAAAACT GTTCAGTGTC 

4 851 TGCCTGAATG GAGGGTGGCT TCCACTGTGT CAGATGCCGT GGCCCACATC 

4 901 TGCCTCTGCA GGGTCCAGTA AAGCACTGGC TATCTTGAGT GTCAGAGACC 

4 951 CAAAGGTCTG TACACTTCAG TACAAGCCCT CCATATTTCA AGGGCACACT 

5 001 CCTACAGTCG TTGGGGTTAT CAGAACTAGC AAACATAGAG ACTGGATTTT 
5051 CAGATGAAAA GAAATCCTTT TTAAAGTCTA AGTATGCCTT ATACAATGTT 
5101 TGAGATATTC TCAATACTAA AAAAAAAAAA ATTGTTGCTT GCTTGAAAAT 
5151 CAAATGTAAC CAAGTGTCCT ATATCCAGTG TCAATCATGG CTGTAGTAGA 
52 01 TGGGAAGAGG GAGCCCGTGG TTTTCACAGT CAGACGCCTG AGTTATTCTT 
52 51 CTAAGTGATA AATTGGTTCC TATAACAAGC AAGCCAGTGA ATATAAATAA 
5301 GCTCTATCTC AGAAGTTATC CTGTAGTGCT ACCCTAGAAT CTAAGAGAGC 
5351 AAAAGTGCTT CAAATTTCAG AATAAGTTTT GCTTTGGACT TCTGTTTTTC 
54 01 TAAACAACTA TAACTTCAAA CCATCTAAGC CTCGTGGGAC ACTTAGAAAT 
54 51 ACCAAGCCAT TCAAAGCTAG AATTGTTTCT TCACCTTACT TGAAAACAAA 



FIG. 4E 



5501 ATGACAACCA AAAATTGTCC 

5551 AAGTCCTGGG CTCAGGGATC 

5601 ACTTCAGGGT AAAGGGCCTG 

5 651 CCGTCAACCC TACCAGAGTG 

57 01 GCTGGCATGT GCAGGGGCAT 

5751 TGTGTACCCT GAGATGCTGT 

5801 AGACACTCTA AAGTGTGACC 

5851 GCAAACACTT GCTTGCTCCT 

5 901 CATTAAAAAA AATAGTTCAC 

5951 ACAGCTTCTG CTCTTAGTCT 

6001 GTCACAAAAC TTTTGTCTGG 

6051 TACCGTCTTT CTGGGTGGGT 

6101 AGGAAAGCAT GAGAAGTAGG 

6151 AATCTATCGA TTCTATAGCG 

62 01 TAGTATCCAC AAGGTGTCAG 

62 51 AGATCTCTGA AG T G AAAAG A 

6301 GTAAGAGACA GAGAGGCAGA 

6351 TAAAATTAAT ATATAGTTTA 

64 01 GTCCTAGGAT AGGCACTGAT 

64 51 ACCAGCCTTT GTACTCAAAG 

6501 CTGCGGATGT CCTTCTGTGG 

6551 GGAAAACAAC CCGTGGGCTC 



15/24 

CCACTGCCCT TGTACATCTT CAGATCAGTA 
ATTCACTTTC TTTCTTTCCT TTCACACTCA 
GAGGAGCTCC TCCCAAGGAC TTGATGTACT 
GAGGACCTGA ATACATTCGG ACCAAAGATT 
CCCTACCTAT GTGGCCGAGG AGATTCCAGG 
ATATCCCAAT GCAGTACTGA GAGAGCCATC 
ACAGACGGAC CAATCATGTG GATTATCAGA 
TGTCAGACAG TTGTCCATGC TTCAAAAGTT 
AGGCTCCTCA CAGAAACCTT AGTAGAATCC 
TACTTTTTAG AAACTGAGAC CCAGAGAAAG 
CTCAGGTTCT ATGTCTTTAA CTTTATAGAA 
GGGCTCTAGA GTAAACTTCA AGTGAGTTCA 
GAAGACCAAA TGAAAGGAGA ATGCCAATGA 
CCAATGCTTA ACTCCTAGGC GTTCAAAGAA 
CCTAAGATCC TAATCTAACA GCAAGTTTTC 
GAAAGCAAGA GAGGAACAGA GACAGAAACA 
GACAAAGAGA CAGGGAGAAT AGAGAGGGAT 
GAAAT T ACGA CTCCTCACAG TCCCTGCAGA 
TTGGACTTCT TTTCTTCTCA CTAGGACCAA 
AAGTGCTACA CAGCTGACAT ACTCTGGATT 
AACATCAGTG GAGACATACT AGAAGTCACA 
TGACCATCGC AATGCTTGAT TATGAGAGTG 
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6601 TTCTCTGGGG GTTGTGATTA GCTTCTTTAA GGCTCAATAA ACCCACGTGG 
6651 CAGCACATCC AGTTTGTAAT GACATGCCTC ATGACTTCTA TGGGAGTCCA 



SEP J 22003 8 



67 01 ATGTGGCACC TGCCAGCCTG TATTCAGGAC CTCTCCGCTA TAAAGCATCC 

67 51 CTCCAGAGTT TTCAAATACT ACAAAGCACA GCCTGGGTTT GGGCTCAGAT 

68 01 AGGCCACTGC TGCCTGACTA CAT T AC AG AC AAACAAGTTT TAAAAGAAAG 
68 51 AAAAAAGAGC TCAGAGTGGC TGGAATCAGC AAGGGTGTTT TTCCTGCAAG 
6901 GAG CC AG AAG TATCAATAAT CACCCAAGGA GGAGACACTG GGAATGAGAG 
6951 ACTAGAACAC ACGCCTGCAG ATACGGAGAA CCTCAGCATT GCCGCTCTCT 
7001 CCCATAACTG CACACCCCCT TCTGTAAACT CTGCTTCTTT CTTTCACCTG 
7 051 AAGATGGCCC TTGCTTTTTT TTATTATAGG ACANGATAAC TAGACCAGAA 
7101 AGTCAACCTG ACTCTCTACA TTTATATGTC TTCCCAGNTC AAGAAAT AT T 
7151 ATTTACTGGT GAATGGCACT TCTATATTCC CTTGGTTCAA TAAGTCTACA 
7201 GGATCCATTC ATTGACAGGC CAAGAGTGAG ATCACATGAT ACCCAAGCAC 



7251 ATGGGTCTTT CCTTGAAGGA GAAGGATCCA 
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ATGTTCGTCGTGGGTCTGCTTGGCCTCCTTGCAGCTCCTGGTTTTGCTTACACGGTCAAC 

ATCAATGGTAATGATGGCAATGTAGACGGAAGTGGACAGCATTCGGTGAGCATCAATGGT 

GTGCACAACGTGGCCAATATCGACAACAATAACGGCTGGGACTCCTGGAATAGCCTCTGG 

GACTATGAAAACAGTTTCGCTGCCACGAGACTCTTCTCCAAGAAGTCATGCATTGTGCAC 

AGAATGAACAAGGATGCCATGCCCTCCCTTCAGGACCTCGATACAATGGTCAAGGAACAG 

AAGGGTAAAGGGCCTGGAGGAGCTCCTCCCAAGGACTTGATGTACTCCGTCAACCCTACC 

AGAGTGGAGGACCTGAATACATTCGGACCAAAGATTGCTGGCATGTGCAGGGGCATCCCT 

ACCTATGTGGCCGAGGAGATTCCAGGACCAAACCAGCCTTTGTACTCAAAGAAGTGCTAC 

ACAGCTGACATACTCTGGATTCTGCGGATGTCCTTTTGTGGAACATCAGTGGAGACATAC 
TAG 
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1 MKLTMFVVGL LGLLAAPGFA YTVNINGNDG NVDGSGQQSV SINGVHNVAN 
51 I DNNNGWDSW NSLWDYENSF AATRLFSKKS CIVHRMNKDA MPSLQDLDTM 
101 VKEQKGKGPG GAPPKDLMYS VNPTRVEDLN TFGPKIAGMC RGIPTYVAEE 
151 IPGPNQPLYS KKCYTADILW ILRMSFCGTS VETY 



FIG. 6 
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THf&/ 1 atgcctgact tctcacttca ttgcattggt gaagccaaga tgaagttcac 

51 aattgccttt gctggacttc ttggtgtctt cctgactcct gcccttgctg 

101 actatagtat cagtgtcaac gacgacggca acagtggtgg aagtgggcag 

151 cagtcagtga gtgtcaacaa tgaacacaac gtggccaacg ttgacaataa 

201 caatggatgg aactcctgga atgccctctg ggactataga actggctttg 

251 ctgtaaccag actcttcgag aagaagtcat gcattgtgca caaaatgaag 

301 aaggaagcca tgccctccct tcaagccctt gatgcgctgg tcaaggaaaa 

351 gaagcttcag ggtaagggcc cagggggacc acctcccaag agcctgaggt 

401 actcagtcaa ccccaacaga gtcgacaacc tggacaagtt tggaaaatcc 

451 atcgttgcca tgtgcaaggg gattccaaca tacatggctg aagagattca 

501 aggagcaaac ctgatttcgt actcagaaaa gtgcatcagt gccaatatac 

551 tctggattct taacatttcc ttctgtggag gaatagcgga gaactaa 



FIG. 7 
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1 MKFTIAFAGL LGVFLTPALA DYSISVNDDG NSGGSGQQSV SVNNEHNVAN 

51 VDNNNGWNSW NALWDYRTGF AVTRLFEKKS CI VHKMKKEA MPSLQALDAL 

101 VKEKKLQGKG PGGPPPKSLR YSVNPNRVDN LDKFGKSIVA MCKGIPTYMA 

151 EEIQGANLIS YSEKCISANI LWILNISFCG GIAEN 



FIG. 8 
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> ^Human 1 MKFTI VFAGLLGVFLAPALANYNI DVNDDNNNAGSGQQSVSVNNEHNVAN 50 

Pig 1 MKFTIAFAGLLGVFLTPALADYSISVNDDGNSGGSGQQSVSVNNEHNVAN 50 

51 VDNNNGWDSWNSIWDYGNGFAATRLFQKKTCIVHKMKKEVMPSIQSLDAL 100 

51 VDNNNGWNSWNALWSYRTGFAVTRLFRKKSCI VHKMKKEAMPSLQALDAL 100 

101 VKEKKLQGKGPGGPPPKGLMYSVNPNKVDDLSKFGKNIANMCRGIPTYMA 150 

101 VKEKKLQGKGPGGPPPKSLRYSVNPNRVDNLDKFGKSIVAMCKGIPTYMA 150 

151 EEMQEASLFFYSGTCYTTSVLWI VDISFCGDTVEN 185 

151 EEIQGANLIS YSEKCISANILWILNISFCGGIAEN 185 
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50 



Human 



MKFTI VF . AG LLGVFLAPAL ANYNIDVN.D DNNNAGSGQQ SVSVNNEHNV 



Pig 



MKFTIAF . AG LLGVFLTPAL ADYSISVN.D DGNSGGSGQQ SVSVNNEHNV 



Mouse 



MKLTM.FVVG LLGLLAAPGF A . YTVNINGN DGNVDGSGQQ SVSINGVHNV 



51 



100 



Human 



ANVDNNNGWD SWNSIWDYGN GFAATRLFQK KTCIVHKMNK EVMPSIQSLD 



Pig 



ANVDNNNGWN SWNALWDYRT GFAVTRLFEK KSCIVHKMKK EAMPSLQALD 



Mouse 



ANI DNNNGWD SWNSLWDYEN SFAATRLFSK KSCI VHRMNK DAMPSLQDLD 



101 



150 



Human 



ALVKEKKLQG KGPGGPPPKG LMYSVNPNKV DDLSKFGKNI ANMCRGIPTY 



Pig 



ALVKEKKLQG KGPGGPPPKS LRYSVNPNRV DNLDKFGKSI VAMCKGI PTY 



Mouse 



TMVKEQK . . G KGPGGAPPKD LMYSVNPTRV EDLNTFGPKI AGMCRGIPTY 



151 



188 



Human 



MAEEMQEASL FFYSGTCYTT SVLWIVDISF CGDTVEN 



Pig 



MAEEIQGANL ISYSEKCISA NILWILNISF CGGIAEN 



Mouse 



VAEEIPGPNQ PLYSKKCYTA DILWILRMSF CGTSVETY 
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